APS/123-QED 

Relations between convergence rates 
in Schatten ]>norms 

Paolo Albinj*] and Alessandro ToigcQ 
Dipartimento di Fisica, Universita di Genova, 
Via Dodecaneso 33, 16146 Genova, Italy 

Veronica UmanitaP 
Dipartimento di Matematica "F. Brioschi", Politecnico di Milano, 
Piazza Leonardo da Vinci 32, 1-20133 Milano, Italy 
(Dated: February 2, 2008) 

Abstract 

In quantum estimation theory and quantum tomography, the quantum state obtained by sam- 
pling converges to the 'true' unknown density matrix under topologies that are different from the 
natural notion of distance in the space of quantum states, i.e. the trace class norm. In this pa- 
per, we address such problem, finding relations between the rates of convergence in the Schatten 
p-norms and in the trace class norm. 
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I. INTRODUCTION 



In recent years, the advent of quantum tomography has brought a renewed interest for 
quantum estimation techniques in physics. The underlying physical issue goes back to a well 
known problem raised by Pauli: for a quantum system, find a set of observables such that 
the knowledge of the associated probability distributions completely determines the density 
matrix. Quantum tomography ([?], \^) solves this problem pointing out a fixed family of 
observables (quorum) and giving a formula that expresses the state of the system in terms 
of their probability distributions. Two questions then arise. First, in many applications 
the quorum is an infinite set, and it is clear that in practice one can measure only a finite 
number of observables. Second, and more fundamental, the exact probability distribution of 
a fixed observable in the quorum can not be measured; experiments yield only an empirical 
estimate of it, based on a finite number of samples. Thus, the main practical problem is to 
determine the rate of convergence of the empirical reconstructed state to the 'true' unknown 
state of the quantum system, as the number of samples increases. The natural notion 
of convergence is with respect to the topology induced by the trace class norm. Indeed, 
this is the weakest topology that assures strong convergence of the probability distribution 
of an arbitrary measurement of the empirical state to the probability distribution of the 
same measurement on the true state, uniformly over all possible measurements. However, 
since reconstruction formulas involve convergence in norms (like the Hilbert-Schmidt norm) 
that are weaker than the trace class norm, almost all the results in the literature consider 
convergence with respect to such weaker norms (j!], 4\), usually not giving explicitely the 
rates. In 5|,|6[], the authors find the rates of convergence, but with respect to a nonmetrisable 
topology. 

A result in [2j shows that on the space of states the trace class and the Schatten p-norm 
topologies with p > 1 are equivalent. However, in Simon's theorem a quantitative relation 
between the rates of convergence in the two norms is missing. 

n 

In this paper, following the work of |2[], we collect some useful inequalities relating the 
trace class norm || - 1 1 ^ with the Schatten p-norms ||-|| for p > 1. This is intended as a 
step for passing from the estimate of convergence rates in p-norms to the rates in trace 
class norm. In our main Theorem [H given trace class operators A and A, with A normal 
and H^llx = ||A)|li — 1; we give an estimation of \\A — Ao^ with a bound depending 
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only on \\A — Aq\\ and on the decreasing rate of the eigenvalues fi n of A . In doing so, 
we have in mind the practical situation in which Aq is the unknown quantum state to be 
determined, and A is its empirical estimate, over which we impose only a normalisation 
condition. Manipulating this result, one can find a function /^ , depending only on the 
/i„'s, such that \\A — Aq^ < fA Q (\\A — A \\ ) || A — A \\ . If the decreasing rate of the /x n 's 
is fast enough, then the product /U (||^4 — Ao\\ _) ||A — Ao\\ p converges to as A approaches 
Aq with respect to the p-norm (see example H]). In particular, assuming Aq is chosen in a 
restricted class of physically feasible states, this allows to find rather explicitly the converging 
rate in the trace class norm once the ||-|| p -rate is known. 



II. NOTATIONS 



Let 7i be a complex Hilbert space, with norm ||-|| and scalar product ( • | • ) linear in the 
second entry. B(7i) is the Banach space of bounded operators on 7i, with uniform norm 
|| -H^. If A G B{U), the modulus of A is \A\ = (A*A) 1/2 . 

For p > 1, we denote by B P (H) the Schatten p-ideals in B{7i). We recall that such ideals 
are the Banach spaces 

B P (H) = {Ae B(H) | tr (|An < oo} 

endowed with the norm 

\\A\\ p =[tr(\An} 1/p . 

We have B P {H) C B q {H) and \\A\\ q < \\A\\ p if p < q. In particular, the inclusion B P {H) 
B q (li.) is continuous. The following Holder inequalities hold 

\\AB\\i< U\\ P \\B\\ p/(p -i), \\AB\\i<\\Mi\\B\\oo- 
Bi{H) is the Banach space of trace class operators on 7i. We denote by S(Ti.) the set of 
states in Bi(H), i.e. the closed subset of positive trace one elements. 

If A G B p (7i), then A is a compact operator, so it has a canonical decomposition 

where / = {0, 1, 2 . . .} is a finite or countably infinite subset of N, {v n } n& i and {u n } n& i are 
orthonormal sets in 7i, and \ n {A) > 0. Moreover, 

\\Mi=j2 x ^ A y- 
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In addition, if A is normal, then the spectral decomposition 

A = ) jJL n {A) \u n ) (u n \ 

holds, where {[in(A)} n€ i are the nonzero eigenvalues of A (each eigenvalue appearing in the 
sequence as many times as its finite multiplicity) and | fJ, n (A) \= X n (A). In particular, if 
A is positive, then the spectral and canonical decompositions coincide, i.e. v n = u n and 
X n (A) = fi n (A) are the strictly positive eigenvalues of A. 



III. MAIN RESULTS 



In this section we suppose 1 < p < +00. 

Theorem 1. Let Aq, A e B\(7i) with Aq normal and ||^o|li — ||^4|li = 1- Let 
^2nei ^(Aq) \u n ) (u n \ be the spectral decomposition of Aq. Then 

II A) - < 3N^/ p \\Aq - A|| p + 2 \Vn(A ) I (1) 

n>N 

for all N en. 

Since some technical lemmas are needed, we postpone the proof of Theorem [1] to the next 
section. 

Remark 1. We notice that the hypotheses of the theorem do not ask for either of the two 
operators to belong to S{7i). This is convenient, since not all the estimation schemes lead to 
actual states as estimates: a notable example is the Pattern Function Projection estimator 
4j, whose estimates are in Bi(TC) but not necessarily positive. It is of course very natural 
to see Aq G S(7i) as the unknown state of the system (since it automatically satisfies all the 
hypotheses) and the less subjected to hypotheses A as its empirical estimate. In this setting, 
eq. ([1]) expresses the 1-norm rate of convergence of A to A in terms of the p-rate and of the 
decreasing rate of the eigenvalues of Aq. Anyway, if we know the estimate A to be normal 
(a rather common case in the literature) then in eq. ([1]) we can exchange the roles of Aq and 
A, thus giving a bound depending on the decreasing rate of the eigenvalues of A instead of 
Aq. We remind the reader that in the general estimation framework we may know nothing 
about the estimand state and its eigenvalue behaviour, while we actually have hold of the 
estimator. Thus which of the two possible interpretation is the best (or the only possible) 
choice depends in general on the case at hand. 



In general one can not exactly compute J2 n >N I /-^(^o) |, but, in several cases, we can 
give an estimation of this quantity, as in the following. 

Example 1. Let A G S(H) with eigenvalues A n (A ) < C(n + l)~ a (n G N) for some C > 
and a > 1. Then 

||A - AH, < 3N^ HA) - A\\ p + JL_ 

for all A G Bi(H) with HA^ = 1, JV > 1. 

Indeed, since the map (x + l)~ a is decreasing, we have 

V XJA ) <CV < C / -dx = -, 

n>7V n>7V V 1 JJS-l \ I 

and so the claimed inequality follows by equation ([!]). 
Example 2. Let us consider A G <S(W) with eigenvalues 

A„(A ) < Ce-P" 

and 

">°. c 's(E„ 2 „^")' I = 1 -^ 

Then 

e -pN 

|| A - A||, < 3^-^ || A - A|| p + 2C r -^ (2) 
for all A G with \\A\\ X = 1, JV > 1. In fact 

EW<CEe-* = C JT^— Wj-^. (3) 

n>N n>N V 1 e 1 e / 1 e 

As a particular case, eq. (j2J) applies to the Gibbs state 

-/3n 

z 



A = ^^— |n)(n| (/3>0), 



n>0 

with 

1 r ^ ■ 1 -' = i- e - 



c= i [y; e -'" 



From the above Theorem, we obtain the following. 

Corollary 1. Let A , {A n } ng ^ + &e elements inBi{H) with A normal and || A 1| x = HA^I, 
1, suc/z t/iat ||A — A n || p — > 0. TTien ||A — A n || 1 — > 0. 



Proof. Fix £ > 0. Choose N e such that J2 n >N e I /^(A)) |< £ /4, and i £ such that 
|| A - Ai\\ p < N^~ p)/p e/6 for i > i E . By eq. d for i > i £ we have 

\\A - Ai\\ t < 3N^ 1)/P \\A - Mp + < e- 

This proves our claim. ■ 
In particular, this implies that the topologies induced on S(H) by Bi(TC) and B p (Tl) coincide. 

Remark 2. Corollary [1] is also a simple consequence of Theorem 2.19 in [2j, noting that 
p-convergence implies convergence in the strong operator topology of {A n } n and {A*} n . 
However, in contrast with Theorem [U Simon's result does not give an explicit relation 
between the rates of convergence in the p- and 1-norms. 

Using eq. [U, we can obtain an alternative estimation of the distance \\A — Aq^. 

Corollary 2. Let A G Bi(TC) be normal and \\A \\ = 1. For e > define 

N Ao (e) := min{iV G N | ^ |/i„(A ) |< e}. 

n>N 

Then, ifq = p/(p-l), 

\\A - AH, < (3^iV A) (||A -A|| p ) + 2) p - A|| p , (4) 
for allAe Bt(H) with {{A^ = 1. 

Note that lim £ ^o Na (e) < oo if and only if has finite rank. 

The above inequality is a less strict result than the bound in eq. [T] and its consequence 
in Corollary [TJ In fact, we show in the next example that one can fix A$ G S(7i) such 
that, for any sequence {A n } ne i + in the unit ball in B\(H) with ||A — A n \\ p — ► 0, one has 
(3yiV Ao (||Ao-A|| p ) + 2) || A - A|| p - oo (while, by Corollary £J ||A - AJ, -»■ 0). 

Example 3. Fix a with 1 < a < 2 - 1/p. Suppose A G S(H) with A„(v4 ) = C(n + 
(n £ ff), where C = [£ neN (n + l)~ a ] _1 - Then > 

1 /" + °° 1 C 

nt^ + ^ (* + !)" a-1 
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so that | 

N A Je) > [—7— — r ] a 1 - 1 for all < e < CI (a - 1). 
\e(a-l)J 

Therefore, if {An}mez+ is a sequence in the unit ball in Bi(H) such that \\A — A m \\ — > 0, 
we get 



\im(3^N Ao (\\A - A m \\ p ) + 2) ||A - A m \\ p = +oo. 
However, if some hypotheses are made about the decreasing rate of the eigenvalues of 



A , then (3^N Ao (\\A - A m \\ p ) + 2) \\A - A m \\ p -> for ||A - A m || p -> 0, as the following 
example shows. 

Example 4. Suppose A Q e <S(W) as in Example [2j Since Ce~ l3N /(l — e _/3 ) < £ if and only 
if iV > -/J" 1 ln(C _1 £(l - e -/J )), inequality (ED implies that 

^A (g) < \ ln( -7 ^ _ m ) + 1 forall0<£<l. 



V^l-e-' 3 ) 

Therefore, if we have a sequence {A m } m€ z + in the unit ball in Bi(H) with ||v4 — A m || p — ► 
for some p > 1, we obtain 



lim {^N Ao {\\A - A m \\ p ) + 2) \\Ao - A m \\ p = 0. 
IV. PROOF OF THEOREM [Tj 

Lemma 1. Let A E B\{H) and P be a projection inTC. If Q = I — P, then 

\\PAP\h + \\QAQW, = \\PAP + QAQ\\ X . 

Proof. Let PAP = E„ e /A n (PAP) \v n ) (u n \ and QAQ = £„ 6j A n (QAQ) |^ n ) (w n \ be 
the canonical decompositions for PAP and QAQ respectively. 

Since {u n } ne i, {v n } n& j are orthonormal sets in P(TC), and {w n }nGJ> { z n}neJ are orthonormal 
sets in Q{7i) = kerP, it follows that {u n , w n } n( ziuj is an orthonormal part in 7i, and then 

PAP + QAQ = J2 (K(PAP)\v n )(u n \+\ n (QAQ)\z n )(w n \) 

n£lUJ 

is the canonical decomposition for PAP + QAQ. This means that X n (PAP + QAQ) = 
\ n (PAP)+\ n (QAQ), and so 

||PAP + QAQ\\ 1 = \\PAP\\ X + IIQAQH, . 



Lemma 2. Let A G Bi(7i) and {Pi\f = \ be a family of mutually orthogonal projections in 
Ti. Then 



N 



i=i 



Proof. Set B := J2? =1 PiAPi. 
Let A = X n (A) \v n ) (u n \ and B = J2 m ^m(B) \z m ) ( w m \ be the canonical decomposition 
of A and B respectively. Then 

JV 

X m {B) = { z m | Bw m ) = 'y]( P { z m | AP(w m ) = ^ ^n{A)i 



i=l 



with a nm := ^f =1 ( PiZ m \ v n ) { u n \ PiW m ). Note that, by Cauchy-Schwartz inequality, we 
have 

( \ f \ V2 

^2 1 anm i - 1 /, k PiZm i Vn ) i 2 1 1 i ( Un i PiWm ) i 2 ) 

n \ «,n / \ i,ra / 

(AT \ V 2 / n \ 1 / 2 

^2 \\ p i z ™\\ 2 I I ^ ||PjW TO |l 2 J < ||2; m || \\w m \\ = 1, 

for {v n } n and {w n }n are orthonormal sets and the projections Pi are mutually orthogonal. 
Similarly, J2 m \ ^nm\< 1- 
Therefore we get 

ll-^lli = ^ \X m (B)\= ^ |y^a nm A n (A) |< \a nm \ X n (A) 



m n 



n \ m / n 

As a simple consequence of the previous Proposition and Lemma [1] we have the following 
Lemma 3. If A E BiiTi), P is a projection in Ti and Q = I — P , then 

WPAPW. + WQAQW, < \\AWj_. 
In particular, if P commutes with A, then 

WPAP^ + WQAQW^WAW,. 
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Proof. The second part follows by Lemma [T] since the commutation between A and P 
clearly implies PAP + QAQ = A. m 
We are now in the condition to prove Theorem [U 

Proof of Theorem [H Let P/y = Yln=o \ Un ) ( Un \ De ^ ne projection on the linear span 
of the first N eigenvectors of A , and Qn = I — Pjv- By triangle inequality 

\\A - A\\ x < \\P N (Ao - A)\\ t + \\Q n (Aq - A)P N \\ 1 + WQnAoQ^ + WQnAQm^ 

By Lemma [3] 

WQnAQnW, < 1-WPnAPnW, (5) 
WPnAoPn]^ = 1 - WQnAoQnW! , (6) 
since P commutes with A . Therefore, by triangle inequality 

WPnAPnW, > -\\P N (A -A)P N \\ 1 + \\P N A Q P N \\ 1 

= - \\P N (A - A)P N \\ 1 + 1 - WQnAqQnI^ 

we have 

Po-^IL < \\P N (A -A)\\ 1 + \\Q N {A -A)P N \\ 1 + \\P N {A -A)P N \\ 1 
+2\\Q N A Q N \\ 1 
< 3 \\ p n\\ p/{p -i) II A) - A\\ p + 2 WQnAqQkW, . 

Since ||Pjv|| p /(p_i) = N^-^/p and | Q N A Q N |= Y. n >N I ^(A) \ \u n ) {u n \ by spectral 
theorem, we obtain 

WQnAoQn]^ = \Vn(A)\ 

n>N 

so that eq. [T] follows. ■ 



Remark 3. If in Theorem [T] it is assumed A ,A G S(TC), then the above proof simplifies, 
since Lemma [3] is no longer needed to prove eqs. [61 In fact, in this case || ^4|| x = tr (A) = 
tr(P N AP N )+tr(Q N AQ N ) = {^AP^ + \\Q N AQ N \\ V 
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